AITopics | statistical capacity

Collaborating Authors

statistical capacity

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

On the Statistical Capacity of Deep Generative Models

Tam, Edric, Dunson, David B.

arXiv.org Machine LearningJan-13-2025

Deep generative models are routinely used in generating samples from complex, high-dimensional distributions. Despite their apparent successes, their statistical properties are not well understood. A common assumption is that with enough training data and sufficiently large neural networks, deep generative model samples will have arbitrarily small errors in sampling from any continuous target distribution. We set up a unifying framework that debunks this belief. We demonstrate that broad classes of deep generative models, including variational autoencoders and generative adversarial networks, are not universal generators. Under the predominant case of Gaussian latent variables, these models can only generate concentrated samples that exhibit light tails. Using tools from concentration of measure and convex geometry, we give analogous results for more general log-concave and strongly log-concave latent variable distributions. We extend our results to diffusion models via a reduction argument. We use the Gromov--Levy inequality to give similar guarantees when the latent variables lie on manifolds with positive Ricci curvature. These results shed light on the limited capacity of common deep generative models to handle heavy tails. We illustrate the empirical relevance of our work with simulations and financial data.

artificial intelligence, machine learning, random vector, (16 more...)

arXiv.org Machine Learning

2501.07763

Country: North America > United States > California (0.46)

Genre: Research Report > New Finding (0.34)

Industry: Banking & Finance (0.48)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (1.00)

Add feedback

The VC-Dimension versus the Statistical Capacity of Multilayer Networks

Neural Information Processing SystemsApr-6-2023, 19:23:16 GMT

A general relationship is developed between the VC-dimension and the statistical lower epsilon-capacity which shows that the VC-dimension can be lower bounded (in order) by the statistical lower epsilon-capacity of a network trained with random samples. This relationship explains quan(cid:173) titatively how generalization takes place after memorization, and relates the concept of generalization (consistency) with the capacity of the optimal classifier over a class of classifiers with the same structure and the capacity of the Bayesian classifier. Furthermore, it provides a general methodology to evaluate a lower bound for the VC-dimension of feedforward multilayer neural networks. This general methodology is applied to two types of networks which are important for hardware implementations: two layer (N - 2L - 1) net(cid:173) works with binary weights, integer thresholds for the hidden units and zero threshold for the output unit, and a single neuron ((N - 1) net(cid:173) works) with binary weigths and a zero threshold. Here W is the total number of weights of the (N - 2L - 1) networks.

classifier, multilayer network, statistical capacity, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Computational Learning Theory (1.00)

Add feedback

The VC-Dimension versus the Statistical Capacity of Multilayer Networks

Ji, Chuanyi, Psaltis, Demetri

Neural Information Processing SystemsDec-31-1992

The former characterizes their "Present Address: Department of Electrical Computer and System Engineering, Rensselaer Poly tech Institute, Troy, NY 12180.

generalization, threshold, vc-dimension, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > New York > Rensselaer County > Troy (0.24)
North America > United States > California > Los Angeles County > Pasadena (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Computational Learning Theory (0.50)

Add feedback

The VC-Dimension versus the Statistical Capacity of Multilayer Networks

Ji, Chuanyi, Psaltis, Demetri

Neural Information Processing SystemsDec-31-1992

The former characterizes their "Present Address: Department of Electrical Computer and System Engineering, Rensselaer Poly tech Institute, Troy, NY 12180.

generalization, threshold, vc-dimension, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > New York > Rensselaer County > Troy (0.24)
North America > United States > California > Los Angeles County > Pasadena (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Computational Learning Theory (0.50)

Add feedback

The VC-Dimension versus the Statistical Capacity of Multilayer Networks

Ji, Chuanyi, Psaltis, Demetri

Neural Information Processing SystemsDec-31-1992

The former characterizes their "Present Address: Department of Electrical Computer and System Engineering, Rensselaer Polytech Institute, Troy, NY 12180.

artificial intelligence, machine learning, vc-dimension, (16 more...)

Neural Information Processing Systems

Country: North America > United States > New York > Rensselaer County > Troy (0.24)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Computational Learning Theory (0.50)

Add feedback